A Two-stage Sieve Approach for Quote Attribution
نویسندگان
چکیده
We present a deterministic sieve-based system for attributing quotations in literary text and a new dataset: QuoteLi31. Quote attribution, determining who said what in a given text, is important for tasks like creating dialogue systems, and in newer areas like computational literary studies, where it creates opportunities to analyze novels at scale rather than only a few at a time. We release QuoteLi3, which contains more than 6,000 annotations linking quotes to speaker mentions and quotes to speaker entities, and introduce a new algorithm for quote attribution. Our twostage algorithm first links quotes to mentions, then mentions to entities. Using two stages encapsulates difficult sub-problems and improves system performance. The modular design allows us to tune either for overall performance or for the high precision appropriate for many use cases. Our system achieves an average F-score of 87.5% across three novels, outperforming previous systems, and can be tuned for precision of 90.4% at a recall of 65.1%.
منابع مشابه
A Sequence Labelling Approach to Quote Attribution
Quote extraction and attribution is the task of automatically extracting quotes from text and attributing each quote to its correct speaker. The present state-of-the-art system uses gold standard information from previous decisions in its features, which, when removed, results in a large drop in performance. We treat the problem as a sequence labelling task, which allows us to incorporate seque...
متن کاملExamining the Impact of Coreference Resolution on Quote Attribution
Quote attribution is the task of identifying the speaker of each quote within a document. While recent research has established large-scale corpora for this task, these corpora are not yet consistent in the way they handle candidate speakers, and many of the reported results rely on gold standard annotations of both entities and coreference chains. In this work we evaluate three quote attributi...
متن کاملTwo-stage Production Systems under Variable Returns to Scale Technology: A DEA Approach
Data envelopment analysis (DEA) is a non-parametric approach for performance analysis of decision making units (DMUs) which uses a set of inputs to produce a set of outputs without the need to consider internal operations of each unit. In recent years, there have been various studies dealt with two-stage production systems, i.e. systems which consume some inputs in their first stage to produce ...
متن کاملAn Integrated Approach for Measuring Performance of Network structure: Case study on power plants
Data envelopment analysis (DEA) and balanced scorecard (BSC) are two well-known approaches for measuring performance of decision making units (DMUs). BSC is especially applied with quality measures, whereas, when the quantity measures are used to evaluate, DEA is more appropriate. In the real-world, DMUs usually have complex structures such as network structures. One of the well-known network s...
متن کاملاثربخشی ساختارگرایی بر تعارضات زناشویی، سبکهای اسناد و بهزیستی اجتماعی
Introductoin: The main objective of theories of family therapy is focused on marital conflicts. Nevertheless marital conflicts still remains a complicated subject. Structural Family Therapy is a fundamental approach among family systemic theories that emphasize on creating a healthy organizational hierarchy in the family system. In system theory, conflict is considered as any dispute over the a...
متن کامل